Tutorial for ISIMP-2001 Recent Developments in Advanced Audio Processing
نویسنده
چکیده
When DVD and home theater systems become more popular these days, high fidelity multichannel (5.1 channel or 10.2 channel) audio systems are well received in the market. Compared with the traditional mono or stereo audio, multichannel audio requires a much more efficient coding scheme for its storage and transmission. This talk will present two new multichannel audio coding techniques: (i) the use of the Karhunen-Loeve Transform (KLT) to decorrelate signals of multiple channels; and (ii) the use of bit-layer coding to achieve an fully embedded bitstream. We exploit the inter-channel redundancy inherent in most multichannel audio sources, and prioritized the transformed channel transmission policy. Experimental results show that, compared with MPEG AAC (Advanced Audio Coding) algorithm, the proposed MAACKL (Modified Advanced Audio Coding with KLT) algorithm not only reconstruct better quality of the multichannel audio material at regular low bit rate of 64 kbits/sec/ch, but also achieves quality scalability in the single multichannel audio bit stream. The embedded multichannel audio coding system inherit the efficient inter-channel de-correlation block in MAACKL algorithm, and add an progressive quantization coding block and a context-based QM noiseless coding block. The final bit stream generated by this embedded multichannel audio coding system has a fully progressive property, which can terminate the encoding or decoding at any arbitrary point. Experimental results show that, compared with MPEG AAC algorithm, the reconstructed multichannel audio using the proposed algorithm achieves better performance not only with objective MNR (Mask-to-Noise Ratio) measurement, but also with subjective listening test at various bit rates.
منابع مشابه
The Progress of Research on Signals and Systems in China of Year 1998~2001
Signals and systems is an important aspect of radio science. The recent progress of the research activities of signals and systems in China is reviewed in this paper. It covers the most advanced and important developments during years of 1998~2001. The review is based on the following categories: wideband CDMA, software radio, digital audio broadcasting technologies, information processing, rad...
متن کاملEnhancing the Interoperability of iSimp by Using the BioC Format
*Corresponding author: Tel: 302 831 8496, E-mail: [email protected] ! Abstract This paper reports the use of the BioC format in our sentence simplification system, iSimp, so that it could be seamlessly used in text mining pipelines. iSimp is designed to simplify complex sentences commonly found in the biomedical text, therefore bringing benefits to existing text mining applications that rely on t...
متن کاملHigh-Fidelity Multichannel Audio Coding Second Edition
Preface Audio is one of the fundamental elements in multimedia signals. Audio signal processing has attracted attention from researchers and engineers for several decades. By exploiting unique features of audio signals and common features of all multi-media signals, researchers and engineers have been able to develop more efficient technologies to compress audio data. Although books on digital ...
متن کاملA Tutorial Survey of Architectures, Algorithms, and Applications for Deep Learning
In this invited paper, my overview material on the same topic as presented in the plenary overview session of APSIPA-2011 and the tutorial material presented in the same conference (Deng, 2011) are expanded and updated to include more recent developments in deep learning. The previous and the updated materials cover both theory and applications, and analyze its future directions. The goal of th...
متن کاملTutorial: Public Engagement Through Audio Internet Experiments
This tutorial paper details experiences of four public engagement projects that have communicated acoustic science to lay audiences using web experiments. Recent developments in personal computers, the Internet and software platforms offers new and exciting opportunities for engaging publics because technologies routinely allow the reproduction of sound. The projects are psychoacoustic experime...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001